Document AL1 
Appl. No. 09/829,066 



WORLD CKTELLECTUAL PROPERTY ORGANIZATION 

International Buiccn 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PAT ENT COOPERATION TREATY (PCT) 

(11) International Publication Number WO 95/20682 

(43) International PnWkatton Date: 3 August 1995 (03.0855) 



(£1) International Patent Cto wifi ra ti on °" : 
C12Q 1/58 



i 



(21) international Application 

(22) Inicrnatioeal Filing Date: 



PCT/US95/01200 
30 January 1995 (30.01.95) 



(30) Priority Data: 
08/189.240 



31 January 1994 (31.01.94) US 



(60) Parent Application or Grant 
(63) Related by Ccntinnatian 
US 

Bled on 



0&/1S9.240 (CTP) 
31 January 199* (31.01.94) 



/TnAonlfcant (for all designated States except US): THE 
RKENT? OF THE UNIVERSITY OF CALIFORNIA 
[US/US* 22nd floor. 500 Lakeside Drive, Oakland, CA 
94612 (US). 

rUS/USl; 231 Uke Drive, KetStngton, CA 94708 (US). 
MYTELKA, Daniel [US/US]: 1709 Sbattucfc Avenue, #202. 
Berkeley. CA 94709 (US). 



(74) Agents: KRUSE, Norman. J. et al.; Townsend and Townsend 
Khourie and Crew, Steuart Street Towei, 20tb floor. One 
Market Plaza, San Francisco, CA 94103 (US). 



«1) Designated States: AM. AT. AU. BB. BG. BR, h\. CA, CH. 

W Ta DE DK. EE ES, FL GB. GE HU, JP, KE KG. 
KP, KR. KZ, LK, LR, LT, LU, LV, MD. MG, MN, MW, 
MX, NL, NO, NZ, PL. PT, RO t RU, SD, SE SI. SK, TJ. 
TT. UA, US. UZ, VN. European patent (AT BE, CH, DE 
DK. ES, FR, GB. GR, IE. IT, LU, MC, NL, PT. SE), OAP1 
patent rBF. BX. CF, CG, CI, CM. GA. GN. ML. MR, NE 
SN. TD, TG), ARIPO patent (KE MW, SD, SZ). 



Published 

Wiih inurnaticnal search report. 



(54) Title: METHODS FOR THE ELIMINATION OF DNA SEQUENCING ARTIFACTS 
(57) Abstract 

» *s srrffiK USSR s AVS2s£Ett~32Sz 

of stopj arpaasw m cham "■JJ"*™*?^ e^Znfen aUo ptcvldK for DNA sequencing to centals these cc*porxris. 
KSt^Kil^t^^P^ «^ «<A polymerase., such * polymerase Cam reaction 



♦ 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify Stales 
applications under the PCT. 



AT 


Autre 


At 


Manilla 


SB 


Barbadot 


BE 


Bdgiam 


BF 


BufcxaaFaio 


fiC 


Bulgaria 


BJ 


» ■ • 


BR 


Brazil 


BY 


Belaru 


CA 




CT 


Caotnl African RrpobOc 


CG 


Canto 


CH 


Switzerland 


a 


Cfite dTvoire 


CM 




CN 




cs 


Czechoslovakia 


cz 


Oecfa Rqufetic 


D£ 


Gcrasnx 


DK 


Devnaxk. 


£5 


Spain 


n 


Rated 


FR 


TfSflOC 


CA 


Gfbcn 



to the PCT on tae front pages 



GB 


United Kingdom 


GE 


Geoifb 


GN 


Guinea 


GR 


Graccft 


htj 


Hungary 


IE 


tehnd 


IT 


Italy 


JP 


lap an 


EE 


Kenya 


KG 


Kyifyctm 


KP 


Democratic People's RrpuWit 




of Korea 


KR 


Republic of Korea 


KZ 


Ki^atiumt 


U 


Ueclnaattia 


LX 


Sri Lanka 


LU 


Laxembonrg 


LV 






Monaco 


MD 


Republic of Motion 


MG 


Madagaacar 


ML 


Mali 


M* 


Mongolia 



pamphlets publishing international 



MA 


MurOxnLa 




Malivi 


NX 


Ni|er 


HL 


Nettertanda 


WO 




NX 


NvwZetaud 


PL 




PT 




RO 


Rcnjnia 


RU 


Rmsan retention 


SD 


Sad«fi 


S* 


Swedes 


51 


Sbvtoia 


SK 


Stanfcia 


SN 


Senegal 


TD 


Cbad 


TC 


Togo 


TJ 


Taffidsian 


TT 


Ttfiutod ad Tobago 


CA 


Ukraine 


US 


United Suits of America 


uz 


Usbrictscao 


VN 


VletNro 



WO 95/20682 



1 



PCT/US9S/0H0O 



METHODS FOR THE ELIMINATION 
OF DNA SEQUENCING ARTIFACTS 

This invention was made with Government support 
under Grant (Contract) No. GH12010 awarded by the National 
Institutes of Health (NIH) . The Government has certain rights 
to this invention. 

BACKGROUND OF THE INVENTION 
The present invention relates primarily to 
improvements in methods of DNA sequencing. In particular, the 
invention relate* primarily to the elimination of stops or 
pauses in chain termination methods of DNA sequencing by the 
addition of nitrogen-containing organic compounds such as 
betaine, trimethyl amine N-oxide and dimethylglycine. The 
invention also provides for DNA sequencing kits containing 
these compounds. The invention also provides for improvements 
> in other laboratory procedures using DNA polymerases, such as 
polymerase chain reaction (PCF) . 

Efficient DNA sequencing technology is very 
important to the development of the biotechnology industry as 
well as for basic biological research, improvements in both 
5 efficiency and accuracy of DNA sequencing are needed to keep 
pace with the demands for DHA sequence information. The Human 
Genome Project, for example, has set a goal for dramatically 
increasing the efficiency, cost-effectiveness and throughput 
of DNA sequencing techniques. (See Collins, F. , and Galas, D. 

30 (1993) Science 262:43-46.) 

Most DNA sequencing today is carried out by a chain 
termination method of DNA sequencing. The most popular chain 
termination methods are variants of the dideoxynucleotide- 
nediated chain termination method of Sanger (see Sanger et al . 
35 (1977) Proc Nat. Acad. Sci., USA 74:5463-5467). Thousands of 
laboratories employ this technique including those doing 
automated sequencing for the Human Genome Project. Commercial 
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kits containing the reagents needed for this method of DNA 
sequencing are available and are widely used. 

Although commonly used, the Sanger (dideoxy) 
sequencing technique has problems and limitations . One of the 
major problems with this method is the incidence of DNA 
polymerase stops or pauses which interfere with the 
determination of the DNA sequence. Stops are predominantly - 
problematic in regions of the DNA that are GC-rich or in 
regions that are especially far from the primer. In addition, 
stops occur more frequently in impure DNA preparations. 
Because of this, DNA purification is generally required before 
DNA can be sequenced by the dideoxy method. 

Various methods have been proposed to eliminate 
stops in dideoxy sequencing. For example, researchers have 
tried varying the reaction temperature, using a variety of DNA 
polymerases, stabilizing the DNA polymerase, and extending the 
prematurely terminated DNA molecules with terminal 
deoxynucleotidyl transferase (see T.W. Fawcette and S.G. 
Bartlett (1990) BioTecfcnigues 9:46-48; D. Pisa-Williamson and 
C.W. Fuller (1992) United States Biochemical Corp. Comments 
19:29-36; J. Sambrook, E.F. Fritsch and T. Maniatis, ed. 
(1989) Molecular Cloning: A Laboratory Manual, second edition, 
Cold Spring Harbor Press, Cold Spring Harbor, N.Y.; and F.M. 
Ausubel, R. Brent, R.Z. Kingston, D.D. Moore, J.G. Seidman, 
J. A. Smith and K. Struhl, ed. (1989) Current Protocols in 
Molecular Biology, Greene Publishing Associates and wiley- 
interscientific, John Wiley and Sons, New York.) However, 
none of these methods has been reliable. There is a 
continuing need to eliminate the problem of stops in. DNA 
sequencing and thereby improve the efficiency and cost- 
effectiveness of this important process. 

SUMMARY OF THE INVENTION 
This invention provides for methods of decreasing 
the incidence of DNA polymerase stops that occur in a reaction 
mixture containing a DNA polymerase. The incidence of DNA 
polymerase stops is decreased by the addition of nitrogen- 
organic molecules, which are described herein, to 
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the reaction mixture. These compositions of the invention are 
added to the reaction mixture in an amount that is effective 
to decrease the incidence of DNA polymerase stops. T&g^ 
reaction »ixture^anj>^^ 

5^u^clnir^e7erably the chain termination **** * 
^T^quencT^is a dideoxynucleotide DNA sequencing method. 
The reaction mixture can also be a PGR reaction mixture. The 
DNA which is sequenced or amplified can be purified or 

Unpurified DNA can be present, for example, as a 
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20 



25 



30 



35 




10 crude cell lysate. 

The method of decreasing the incidence of DNA 

polymerase stops can be combining, in an aqueous solution, a 

f 1 — a mixture 



DNA molecule, a DNA polymerase or DNA polymerases 
of deoxyribonucleoside triphosphates, a chain elongation 
inhibitor, and one or more of the compositions of the 
invention to form a reaction mixture. The DNA polymerase is 
capable of producing a nucleic acid complementary to a portion 
of the DNA molecule by using the DNA molecule as a template. 
The reaction mixture is incubated to allow the DNA polymerase 
to form nucleic acid fragments of varying length by using the 
DMA molecule as a template. The nucleic acid fragments that 
are formed are complementary to the DNA molecule that is being 
sequenced. 

in some variations of the above method, a nucleic 
acid primer is also added to the reaction mixture. This 
primer is complementary to a second portion of one strand of 
the DNA molecule that is located downstream from the first 
portion of that strand. The DNA polymerase that is used is 
capable of extending the 3 ■ end of the primer to produce a 
nucleic acid that is complementary to the first portion of the 

DNA molecule. 

The invention provides improved methods for 
sequencing a DNA molecule by the chain termination method, 
wherein the DNA molecule is combined with a DNA polymerase 
capable of producing a nucleic acid complementary to a portion 
of the DNA molecule by using the DNA molecule as a template; 

with a mixture of deoxyribonucleoside triphosphates; and with 
chain elongation inhibitor, to form a reaction mixture. This 
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reaction mixture is incubated to permit the DNA polymerase to 
form nucleic acid fragments of varying length by using the DMA 
molecule as a template, wherein the nucleic acid fragments are 
complementary to the DNA molecule. The improved part of this 
method is the addition to the reaction mixture of an amount of 
one or more of the nitrogen-containing organic compounds 

described herein. 

The invention also provides for Xits for DNA 
sequencing by the chain termination method. These kits have 
instructional material, a container which contains one or more 
deoxyribonucleoside triphosphates, a container which contains 
a chain elongation inhibitor, and an amount of the 
compositions of the invention. The Xits may also contain 
other components. Preferably, the kits are for DNA sequencing 
by the dideoxy DNA sequencing method. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A. DNA polymerase stops in dideoxy DNA 
sequencing. Dideoxy DNA sequencing of double-stranded DNA was 
carried out as described in example 1 herein, except that 
water was substituted for betaine (results were identical with 
or without water) . The reaction mixture was separated on a 
polyacrylamide/urea DNA sequencing gel as described in example 
1. The positions of DNA polymerase stops are shown by arrows. 

Figure IB. Elimination of DNA polymerase stops in 
dideoxy DNA sequencing by betaine. Dideoxy DNA sequencing was 
carried out in the presence of betaine as described in example 
1. The reaction mixture was separated on a 

polyacrylamide/urea DNA sequencing gel as described in example 
l. The arrows show the positions corresponding to the DNA 
polymerase stops of figure 1A. 

DEFINITIONS 

The term "nucleic acids", as used herein, refers to 
either DNA or RNA. It includes plasmids, infectious polymers 
of DNA and/or RNA, nonfunctional DNA or RNA, chromosomal DMA 
or RNA and DNA or RNA synthesized in vitro (such as by the 
polymerase chain reaction) . "Nucleic acid sequence" or 
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20 



25 



30 



"polynucleotide sequence" refers to a single- or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide 
bases read from the 5' to the 3' end. 

The term "DNA molecule" as used herein refers to DNA 
B olecules in any form, including naturally occurring, 
recombinant, or synthetic DHA molecules. The term includes 
plasmids, bacterial and viral DMA as wall as chromosomal DNA. - 
The term encompasses DNA fragments produced by cell ly-U or 
subsequent manipulation of DNA molecules. Unless specified 
otherwise, the left hand end of single-stranded DNA sequences 

is the 5« end. t 

The term •upstream" when used in respect to a single 

stranded DNA molecule, refers to those DNA sequences located 
5. to a reference region of the DNA molecule. For example, 
those sequences located upstream from a defined portion of a 
DNA molecule are those located 5« to the defined portion. 
Similarly, the term "downstream", when used with respect to 
single stranded DNA molecule, refers to those DNA sequences 
located 3» to a reference region of the DNA molecule. 

The term -complementary" as used herein refers to a 
relationship between two nucleic acid sequences. One nucleic 
acid sequence is complementary to a second nucleic acid 
sequence if it is capable of forming a duplex with the second 
nucleic acid, wherein each residue of the duplex forms a 
guanosine-cytidine (G-C) or adenosine-thymidine (A-T) basepair 
or an equivalent basepair. Equivalent basepairs can include 
nucleoside or nucleotide analogues other than guanosme, 
cytidine, adenosine, or thymidine, which are capable of being 
incorporated into a nucleic acid by a DNA or BNA polymerase on 
a DHA template. A complementary DNA sequence can be predicted 
from a known sequence by the normal basepairing rules of the 
DNA double helix (see Watson J.D., et al. (1987) nolecular 
Biology or the Gene, Fourth Edition, Benjamin Cummings 
publishing Company, Wenlo Park, California, pp. 65-93) 
complementary nucleic acids »ay be of different sizes, 
example, a smaller nucleic acid may be complementary to a 
portion of a larger nucleic acid. 



For 
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The terra "DNA template" or "template" as used 
herein, refers to a DNA molecule or portion of a DNA molecule 
that is used by a DNA or RNA polymerase to determine the 
sequence of a newly synthesized nucleic acid. (See Watscn. 
5 j.d., «tal. f supra for a detailed description of the action 
of DNA and RNA polymerases on template DNA molecules) . 

The term "replicating", as used herein in reference- 
to a DNA polymerase, refers to the process of using a DNA 
»olecule as a template to produce a nucleic acid complementary 
10 to a portion of the DNA molecule. 

The terms "purified DNA" or "purified DNA molecule," 
as used herein, refers to DNA that is not contaminated by 
other biological macromolecules , such as RNA or proteins, or 
by cellular metabolites. Purified DNA contains less than 5% 
15 contamination (by weight) from protein, other cellular nucleic 
acids and cellular metabolites. The terms "unpurified DNA- or 
-unpurif ied DNA molecules- refer to preparations of DNA that 
have greater than 5% contamination from other cellular nucleic 
acids, cellular proteins and cellular metabolites. Unpurified 
20 DNA may be obtained by using a single purification step, such 
as precipitation with ethanol combined with either LiCl or 
polyethylene glycol. The term "crude cell lysate preparation" 
or "crude cell lysate" or "crude lysate" refers to an 
unpurified DNA preparation where cells or viral particles have 
25 been lysed but where there has been no further purification of 
the DNA. 

Some compounds of the invention may be present with 
a positive or negative charge or with both a positive and 
negative charges, depending on the pH of the solution. It is 
30 understood that these various forms of these compounds are 
included in the present invention. 

The term betaine, as used herein, refers to N,N,N- 

tr imethy lglyc ins . 

All numerical ranges in this specification are 
35 intended to be inclusive of their upper and lower limits. 
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DETAILED DESCRIPTION 




a number of different methods of 
determining the sequence of DNA molecules. Most methods in 
5 common use today are variations on one of two general methods. 
One general scheme uses chemical reagents which react with 
specific bases to allow other chemicals to cleave the 
phosphodlester backbone at those points. The second general 
method uses a polymerase enzyme to produce DNA fragments 
10 complementary to the DNA molecule to be sequenced. The 

inclusion of low amounts of chain elongation inhibitors in the 
reaction mixture causes different sized DNA fragments to be 
generated. The DNA fragments generated by either chemical 
methods or chain termination methods can be separated and 
IS analysed by a variety of methods. For example, they can be 

separated on a sequencing gel based on size. By examining the 
bands present at any position on the gel, the sequence of the 
DNA molecule can be determined at the corresponding position. 
For example, the top of the sequence noted in Figure 1A. 
20 contains three consecutive 6 nucleotides, since bands are 
present at that position in only the S lane. 

Chain termination methods of DNA sequencing are much 
more widely used than chemical cleavage methods. 
Dideoxynucleoside triphosphates are the most common chain 
25 terminators used in DNA sequencing. For examples of specific 
protocols for dideoxy DNA sequencing, see Sambrook, et al., 
supra. 

in spite of its wide use, there are technical 
problems associated with the dideoxy sequencing method, one 
30 of the major problems is the incidence of stops or pauses of 
the DNA polymerase which interfere with the determination of 
the DNA sequence. When the DNA polymerase stops at a 
particular site, bands appear in all four lanes of the 
sequencing gel, and the nucleotide residue at that position 

35 cannot be determined. 

DNA polymerase stops occur for several reasons. 
Stops are observed more frequently as the enzyme extends 
farther from the 3' end of the primer. This phenomenon 
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the length of DMA fragment that can sequenced at one tine. 
Reducing the incidence of stops allows longer pieces of DNA to 
be sequenced and therefore makes DHA sequencing more 

efficient. . 
5 DHA polymerase stops also occur more frequently when 

contaminants are present in the DNA preparation. Because of 

this phenomenon, DNA preparations must be purified prior to _ 

sequencing. The improved DNA sequencing method of the 

invention decreases the incidence of stops and therefore 

10 allows less pure preparations of DMA to be reliably sequenced. 

This, in turn, increases the efficiency and reduces the cost 

of DNA sequencing. 

Lastly, even under optimal conditions, DNA 
polymerase stops are often seen in GC-rich regions of the DNA. 

15 This makes DNA sequence information in certain areas of a DNA 
molecule difficult to determine. This problem is, of course, 
compounded when one is trying to determine a DNA sequence in a 
GC-rich region far from the 3- end of the primer or in a DNA 
preparation that has not been optimally purified. Thus, the 

20 improved method of the invention increases the accuracy and 
efficiency of DNA sequencing while reducing its cost. 

The reduced incidence of DNA polymerase stops caused 
by the compounds of the invention also has utility in 
procedures other than DNA sequencing. Numerous common 

25 laboratory techniques, such as PCR {polymerase chain 

reaction) , in vitro mutagenesis, nick translation, reverse 
transcription and blunt ending utilize DNA polymerases. The 
addition of appropriate concentrations of the compounds of the 
invention can potentially increase the efficiency or speed of 

30 




r eminds th - t * h » incidence of W* P°tt» 

St( 



The compounds used in the present invention are 
35 nitrogen-containing organic molecules that are capable of 

eliminating or reducing the incidence of stops occurring in 
chain-termination methods of DNA sequencing and in other 
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laboratory procedures using DHA polymerases, such 
These compounds are represented by the formula: 



R2-H-X CD 

wherein: ~ 

R 1 , R 2 , and R 3 may be the same or different and are 

independently selected from the group consisting of hydrogen, 
methyl, ethyl, and propyl, with the proviso that no more than 
two of R 1 , R 2 , and R 3 hydrogen; and 

X is a moiety selected from the group consisting of: 

radicals of the formulas 
(a) =0 ; and 



R* 



I S 
(b) -CH-(CH 2 ) n -R 



wherein: 



R 4 is selected from the group consisting of methyl, 
hydrogen and, when combined with R 1 ' forms a 
pyrrolidine ring; 

R 5 is selected from the group consisting of -C0 2 H 
and -S0 3 h; and 

n is an integer of from 0 to 2; and 
with the proviso that, when R 1 and R« form a pyrrolidine 
ring, no more than one of R 2 and R 3 is hydrogen, and 
wherein the composition is added in an amount effective 
to decrease the incidence of stops. 

When a pyrrolidine ring is formed by R 1 and R 4 ' a 
compound of formula II is formed. 



82 _k_J_ (CH^R 5 (II> 
R 3 



in certain preferred embodiments, the methods and 
kits of this invention use compounds of formula I wherein R 1 , 
R 2 and R 3 are the same or different and selected from the 



WO 95720682 PCT/OSW01200 

10 

group consisting of methyl, ethyl and hydrogen with the 
proviso that no more than two of R 1 , R 2 and R 3 are hydrogen 
and f when R 1 and R 4 form a pyrrolidine ring, no more than one 
of R 2 and R 3 are hydrogen. 

In another group of preferred embodiments, the 
methods and kits of this invention use a compound of formula I 
wherein X is ~CH 2 C0 2 H. Further preferred embodiments within - 
this group use compounds where R 1 , R 2 and R 3 are methyl; where 
H 1 , R 2 are methyl and R 3 is hydrogen; or where R 1 is methyl 
and R 2 and R 3 are hydrogen. 

In further preferred embodiments, the methods and 
kits of this invention use a compound of formula I wherein X 
is «o and R 1 , R 2 and R 3 are methyl. 

In still further preferred embodiments, the methods 
and kits of this invention use a compound of formula I wherein 
R 1 and R 4 form a pyrrolidine ring, R 2 and R 3 are methyl, n is 
0, and R 5 is -C0 2 H (stachydrine, formula III), 



HC-Q-COOH (I") 

CK 3 



In yet another group of preferred embodiments, the 
methods and kits of this invention use compounds wherein R 1 , 
R 2 , and R 3 are methyl and X is -CH 2 -S0 3 H (sulfobetaine) - 

In general, the compounds used to eliminate or 
reduce the incidence of DKA polymerase stops in DNA (sequencing 
and in other laboratory procedures using a DNA polymerase are 
commercially available. For example, betaine, 
dimethylglycine, sarcosine, and trimethylamine N-oxide can all 
be obtained from Sigma Chemical Company (St. Louis, Missouri, 
USA) . 

These compounds may also be synthesized by routine 
methods known to those of skill in the art. For example, 
compounds of formula wherein R 4 is K, n is 0 and R 5 is -C0 2 H 
can be synthesized by the method of Lloyd, et al. (1992) J* 
Pbarm. Pharmacol • 44s 507-511. In general, ethyl 
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chloroacetate is heated to reflux with the appropriate 
tertiary amine in ethanol. When the reaction is complete, the 
ethanol is removed f roB the reaction mixture by evaporation 
under reduced pressure. The residue is dissolved in 3-6* w/v 
5 aqueous HCl and varmed to reflux. Evaporation of the solvent 
under reduced pressure provides the desired products. 
Typically, these products can be recrystallized from an 
acetonitrile/water mixture. 

Compounds of formula I wherein R« is H or CH 3 , n is 
10 l and R 5 i. C0 2 H can b. synthesized by the method of Fiedorek, 
F.T., U.S. Patent Mo. 2,548,428. In brief, betalactones are 
reacted with tertiary amines to provide the desired compounds. 

Compounds of formula I wherein R is H, n is 2, and 
R 5 is -C0 2 H can be synthesized by the method of Aksnes, G., et 
15 al. J. Chem. Soc. London 1959:103-107. In brief, 4- 

bromobutyric acid (Aldrich Chemical Co., Milwaukee, Wisconsin, 
USA) is converted to a methyl ester by treatment with methyl 
alcohol and catalytic sulfuric acid. Subsequent treatment of 
the methyl ester with excess alcoholic tertiary amine provides 

20 the desired compounds. 

compounds of formula I wherein R and R are taken 
together to form a pyrrolidine ring and where R 5 is C0 2 H are 
synthesized by the general method of Karer, et al. (1925) 
Helv. cnim. Acta. 8: 364. For example, stachydrine is formed 
25 by the methylation of proline, according to this procedure. 

Compounds of formula I wherein X is -o are 
synthesized by oxidation of the corresponding tertiary amines 
(see March, J. (1992) Advanced Organic Chemistry, Reactions, 
mechanisms and Structure, fourth Edition, John Wiley and Sons 
30 New York, pp. 1200-1201) . Typically, the oxidation is carried 
out with hydrogen peroxide, but other peracids may also be 

used* . 

Sulfobetaine can be synthesized according to the 

procedure of King, J.F.. et al. (1985) J. Phosphorus Sulfur 
35 25: 11-20. Other compounds of formula I wherein R is -S0 3 H 

can also be synthesized by modifications of this procedure and 
by other methods known to those of skill in the art. 
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y r? mvfld »etfr " ?g for DNA sequencing, polymerase chain 
;ref rt i Qn and ot * er laboratory procedures using Pflft 

Chain termination methods of DNA sequencing 
The present invention encompasses an improvement in 
DNA sequencing by the chain termination method. As used 
herein, the terms "chain termination method of DNA 
sequencing," "chain termination method" or "chain termination 
DNA sequencing method" refer to a DNA sequencing method that 
uses a DNA polymerase to produce nucleic acid fragments 
complementary to a portion of the DNA molecule to be 
sequenced. Generally, a primer is used that is complementary 
to a portion of the DNA molecule. The primer is extended 
along the DNA molecula template by the DNA polymerase. The 
principle of the method is that low amounts of specific chain 
elongation inhibitors are included in the reaction mixture so 
that the DNA polymerase will only occasionally incorporate an 
inhibitor and terminate. Generally, four reaction mixtures 
are set up, each with a different chain elongation inhibitor, 
capable of specifically terminating at a guancsine, cytosine, 
adenosine or thymidine residue. The DNA fragments generated 
by incubation with the DNA polymerase can be separated and 
analyzed to determine the sequence of the DNA molecule. 
Typically, the fragments are separated by gel electrophoresis 
and detected by autoradiography, although other separation or 
detection methods may also be used. (See Sanger, et al., 
supra and SambrooX, et al., supra, for a more detailed 
description of the chain termination method of DNA 
sequencing.) This procedure is amenable to automation, and 
various specific methods have been created for that purpose, 
including single lame sequencing, laser detection and 
capillary electrophoresis. 

The terms "chain elongation inhibitors" or "chain- 
terminating inhibitors", as used herein, refer to compounds 
that terminate nucleic acid chain elongation by a DNA 
polymerase enzyme. As described above, these compounds may be 
useful in chain termination methods of DNA sequencing. 
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The terms "pause", "stop", "DNA polymerase pause- or 
"DNA polymerase stop- as used herein refer to a phenomenon 
wherein the DNA polymerase stops or does not function at a 
particular nucleotide when the DNA polymerase is incubated 
5 with DNA in a chain terminating DNA sequencing method. This 
is a phenomenon in which DNA polymerase molecules fail to 
continue elongation despite the presence of the next 
nucleotide to be incorporated and the absence of any manifest 
reason why elongation should cease (such as the incorporation 
10 of a dideoxynucleotide) . When the DNA fragments are displayed 
on a DNA sequencing gel, stops appear as bands in all four 
lanes of the gel at a position corresponding to the length of 
the fragment that the DNA polymerase molecules had difficulty 
elongating, so the identity of the nucleotide at that position 
15 cannot be determined (see arrows in Figure 1A for examples) . 

There are a variety of different chain termination 
methods for sequencing DNA. The most commonly used method is 
the dideoxynucleotide DNA sequencing method. The terms 
-dideoxynucleotide DNA sequencing method" or "dideoxy DNA 
20 sequencing method", as used herein, refer to a chain 

termination method of DNA sequencing wherein the chain 
elongation inhibitors are 2'3 '-dideoxynucleosides or their 
derivatives . Typically , 2 • 3 • -dideoxynucleoside triphosphates 
are used. A variety of other chain elongation inhibitors may 
25 also be used. For example, arabinonucleoside derivatives or 
3 • o-methyl deoxyribonucleotide derivatives may be used in 
chain termination methods. (See Sanger, et al., supra and 
Axelrod, V.O., at al. (1978) N.A.K. 5:3549-3563.) 

A variety of different enzymes can be used to 
30 produce DNA fragments complementary to the DNA molecule to be 
sequenced. For example, T7 DNA polymerase, Taq polymerase, 
and the Klenow fragment of DNA polymerase I are all used in 
DNA sequencing. UNA polymerase and reverse transcriptase have 
also been used. DNA polymerases may be genetically or 
35 physically altered to optimise the enzyme for use in DNA 
sequencing. 

The DNA polymerases require a primer sequence. The 
primer sequences may be generated by digestion of the DNA 



WO 95720682 



PCT/US95/01200 



14 

molecule or may be added exogenously as a primer molecule. 
The terns "primer molecule" , "primer" or "nucleic acid primer" 
or "DNA primer" , as used herein, refer to a single-stranded 
nucleic acid molecule which is complementary to a portion of 
the DMA molecule to be sequenced. Primers are allowed to 
anneal to a DNA molecule, so that subsequent elongation of the 
3' end of the primer by a DNA polymerase produces a nucleic ~ 
acid sequence complementary to a portion of the DNA molecule 
to be sequenced. Primers are commonly used in chain 
termination methods of DNA sequencing. However, in certain 
modifications of DNA sequencing by the chain termination 
method, primer sequences may be generated by digestion with 
appropriate enzymes (see below). 

In an alternative embodiment, DNA sequencing vectors 
may be used to facilitate DNA sequencing by chain termination 
methods. A variety of such vectors may be used, including 
H13, Bluescript vectors such as pBS-KS + and pBS-KS~ 
(Stratagene, San Diego, California, U.S.A.), pDC vectors such 
as pUC18 and pUC19 and pBR322 vectors. The DNA molecule to be 
sequenced is inserted into the sequencing vector. Primers can 
be constructed that are complementary to a region of the DNA 
in the sequencing vector, so that elongation from the 3 1 end 
of the primer will produce DNA fragments complementary to 
portions of the DNA molecule to be sequenced . Thus, when 
sequencing vectors are used, the DNA molecule to be sequenced 
can be present as an insert in the vector. Primers that are 
complementary to a region of the DNA sequencing vector can be 
used, so that chain elongation occurs from the 3 ■ end of the 
primer into the region of the DNA insert, thereby allowing 
determination of the DNA sequence of the insert. 

The DNA preparation for sequencing may be either 
purified or unpurif ied» Unpurif led DNA may be a crude cell 
lysate* The compounds of the invention that eliminate or 
reduce the incidence of DNA polymerase stops allow for the use 
in DNA sequencing of unpurif ied DNA preparations, including 
crude cell ly sates. 

The DNA preparation may be present in single- 
stranded or double-stranded form. For instance, linear 
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double-strand.* DMA »ay be generated by PGR. When such DNA is 
sequenced, the incidence of DNA polymerase stops is reduced or 

eliminated . . 

The mixture of DNA fragments generated in chain 

termination methods of DMA sequencing is most commonly 
separated and analyzed by gel electrophoresis. See figure 1A 
and IB herein for an example of a DMA sequencing gel. Other _ 
separation methods useful for separating DNA fragments may 
also be used (see Sambrook, et el., supra). For instance, 
D mass spectrometry can be used to analyze mixtures of DHA 

fragments (see Jacobson, et ml. (1991) J. Genetic Analysis, 
Techniques and Applications 8:223-229. 

The DHA fragments may be labeled or unlabeled. Tor 
example, unlabeled DHA fragments may be detected by 
5 ultraviolet spectroscopy in an automated technique or by 
silver staining after gel electrophoresis. However, it is 
generally desirable to produce labeled DNA fragments to 
facilitate their detection. For instance, radiolabeled DHA 
fragments may be produced. They may be produced, for example, 
►0 by use of a radioactive primer. Alternatively, radioactive 
nucleotides may be incorporated during the incubation with a 
DNA polymerase. When radiolabeled DNA fragments are produced, 
they may be separated by gel electrophoresis and visualized by 
autoradiography (see figures 1A and IB, herein) . A wide 
25 variety of non-radioactive labels may also be used including 
fluorophores, chemilumine scent agents and small detectable 
molecules, such as biotin. Per example, in automated 
sequencing techniques, fluorescent or eclorimetric labels can 
be detected by laser spectroscopy using capillary tubes or 

30 sequencing gels. 

Non-radioactive labels may be attached by indirect 

means. Generally, a ligand molecule (e.g., biotin) is 
covalently bound to the molecule. The ligand then binds to an 
anti-ligand (e.g., streptavidin) molecule which is either 
35 inherently detectable or covalently bound to a signal system, 
such as a detectable enzyme, a fluorescent compound, or a 
chemiluminescent compound. A number of ligands and anti- 
ligands can be used. Where a ligand has a natural anti- 
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ligand, for example, biotin, thyroxine, and Cortisol, it can 
be used in conjunction with the labelled, naturally occurring 
anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

There are a variety of modifications of the chain 
termination method of DNA sequencing. For example, DNA 
sequencing may be performed manually, by semi-automated 
procedures or by the use of an automated DNA sequencing 
instrument. In addition, chain termination DNA sequencing 
technology may be combined with other techniques. For 
example, in "exometh" methods, a DNA molecule is treated with 
exonuclease to generate a series of DNA molecules with single- 
stranded 5- extensions of varying length. These DNA molecules 
can then be sequenced directly by a chain termination method 
15 of DNA sequencing without the addition of an exogenous primer. 
(See Sorge, J.A. , et al. d»89) Proc. Nat. Acad. Sci, USA 86: 
9208-9212 for a more detailed description of DNA sequencing by 
the "exometh" modification.) Such modifications of chain 
termination DNA sequencing methods are all encompassed by the 

20 present invention. 

The compounds of the invention may be added to the 
reaction mixture before or during the incubation of the DNA 
with the DNA polymerase to eliminate or reduce the incidence 
of stops. The order of addition of the DNA molecule, the DNA 

25 polymerase, the chain elongation inhibitors, the 

deoxyribonucleoside triphosphates, the primer, and the 
compounds of the invention may also be varied. As 
demonstrated in Example 4, the compounds that reduce or 
eliminate the incidence of stops may even be added after the 

30 normal incubation time for the DNA polymerase, and the 

incubation may be extended to remove DNA polymerase stops. 



p^Y^j-,.** ehatr t *- eaetion Procedures 
The present invention also encompasses an 
35 improvement in nucleic acid amplification procedures, such as 
PCR, which involve chain elongation by a DNA polymerase. 

There are a variety of different PCR techniques 
which utilize DNA polymerase enzymes, such as Taq polymerase. 



WO 95/20682 



PCT/US95/01200 



17 



See PCR Protocols: A Gaide to Methods and Applications. 
(Innis, M, Gelfand, D. , Sninsky, J. and White, T. , eds.), 
Academic Pres., San Diego (1990) for detailed description of 
PCR methodology. In a typical PCR protocol, a target nucleic 
acid, two oligonucleotide primers (one of which anneals to 
each strand) , nucleotides, polymerase and appropriate salts 
are mixed and the temperature is cycled to allow the primers _ 
to anneal to the template, the polymerase to elongate the 
primer, and the template strand to separate from the newly 
synthesized strand. Subsequent rounds of temperature cycling 
allow exponential amplification of the region between the 
primers. The primers can anneal to both the original template 
and the newly synthesized nucleic acid, as long as the 
polymerase is able to extend at least as far as the position 
to which the other primer anneals. For this reason, the 
ability of PCR to amplify a product is primarily limited by 
the ability of the polymerase to extend the annealed primer. 
The ability of the polymerase to extend the annealed primer is 
dependent on the distance between the primers and the 
nucleotide composition of the sequence between them. The 
addition of an appropriate amount of one or more of the 
compounds described herein facilitates this elongation. See 
example 8 and example 9, herein, for a demonstration of the 
effects of betaine on chain elongation in PCR protocols using 
Tag polymerase. Other compounds of the invention in addition 
to betaine can be used in a similar manner to improve chain 
elongation in a variety of different PCR methods. 

The compounds of the invention also have utility in 
reducing DNA polymerase stops in laboratory procedures other 
than DNA sequencing and polymerase chain reactions. Numerous 
common laboratory techniques, such as in vitro mutagenesis, 
nick translation, reverse transcription and blunt ending 
utilize DNA polymerases. See Sambrook, et al. (1989) 
molecular Cloning: A Laboratory Manual, second edition, Vol. 
1-3, Cold spring Harbor Press, Cold spring Harbor, N.Y. (1989) 
for a description of these methods. Thus, the compounds of 
the invention are useful in a variety of procedures involving 
chain elongation by a DNA polymerase. 
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T » ffr i« g an* »^imizafi"" a* coTtmnnndfl that reduce or 

stops 




compounds capable of eliminating or reducing the 
incidence of stops in chain termination methods of DNA 
sequencing and in other laboratory procedures using DNA 
polymerase are described above. These compounds may be tested 
for their relative ability to eliminate or reduce DNA 
polymer ase stops. For instance, the compounds may be tested 
by the procedure described in Example 1, herein. Briefly, DHA 
sequencing is carried out by a dideoxy sequencing method in 
the presence or absence of a selected concentration of a 
compound of the invention. The DHA sequencing gels obtained 
in the presence and absence of the compound are then compared 
to determine the effectiveness of the compound in eliminating 
DHA polymerase stops (see for example, figure 1A and IB) . 
Effective concentrations for each of the compounds may be 
determined by this procedure. Optimal concentrations for a 
given compound may vary for the different variations of chain 
termination DHA sequencing methods. These concentrations may 
be readily determined experimentally by adding different 
amounts of a compound and determining the incidence of stops 
(see for instance. Example 7, herein). 

This invention also encompasses kits for DHA 
sequencing by the chain termination method which comprise an 
amount of one or more of the compounds described herein that 
eliminate or reduce the incidence of DNA polymerase stops. 
The kits may further comprise instructional material, a 
container which contains one or more deoxyribonucleoside 
triphosphates, a container which contains a DNA polymerase, 
and a container that contains a chain elongation inhibitor. 
The containers in the kit may be combined in various ways. 
Per example, a chain elongation inhibitor may be combined in 
the same container with the mixture of deoxyribonucleoside 

triphosphates. 

This invention also encompasses kits for DHA 
polymerase chain reaction which comprise an amount of one or 
more of the compounds described herein that eliminate or 
reduce the incidence of DHA polymerase stops. The kits may 
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further comprise instructional material, a container which 
contains a DNA polymerase, and a container that contains 
deoxyribonueleoside triphosphates. 

Unless defined otherwise, all technical and 
scientific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this 
invention belongs. Although any methods and materials similar 
or equivalent to those described herein can be used in the 
practice or testing of the present invention, the preferred 
methods and materials are now described. All publications 
mentioned hereunder are incorporated herein by reference. 
Unless mentioned otherwise, the techniques employed or 
contemplated herein are standard methodologies well known to 
one of ordinary skill in the art. The materials, methods and 
examples are illustrative only and not limiting. 



EXAMPLES 



Betaine eliminates stops that are routinely observed 
in a standard didaoxy sequencing procedure. 

Betaine monohydrate was obtained from Sigma and 
stored as a 5.5M stock solution at -20-C. Modified T7 
polymerase (Sequenase 2.0 (Amersham, Arlington Heights, 
Illinois, U.S.A.) ) and nucleotide mixes were purchased from 
usb. 35 S-dATP was purchased from Amersham. 

DMA sequencing generally followed the protocol of 
Dei sal et ml. (1989) BioTechnigues 7:514-519. Sequencing was 
performed by mixing 1 picomole (pmole) of supercoiled double 
stranded plasmid DNA (purified by running over a QIAGEN 
tip-100 column according to the manufacturer's specifications) 
vith 2-4 pmole of various oligonucleotide primers (generally 
purified by polyacrylamide gel electrophoresis, though this 
proved unnecessary for primers under 30 nucleotides) and 
adjusting the volume to 10 Ml containing 0.1 N HaOH. The tube 
was incubated for ten minutes at 68 then it was -vedto 
room temperature and 4 ul of TDMN (200 mM Naci, so mM DTT, 120 
mM HC1, 80 mM MgCl 2 , 280 mM TES) were added. After a further 
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ten minute incubation at room temperature, 2 /il of GTP 
labeling mix (7.5 *M dCTP, 7.5 *iH dGTP, 7.5 /iM dTTP) and 5 M Ci 
of 35 S-dATP were added, followed by 2 m! of Sequenase (diluted 
1S8 in cold 10 mM Tris.Cl pH 8/0, 1 n* EDTA) . This mix was 
incubated for five minutes at room temperature, then 3.5 /il 
were aliquoted to each of four tubes preheated to 37 °C, each 
containing 3.5 Ml of 5.5 M betaine and 2.5 ^1 of one of the - 
termination mixes (80 fM each dOTP, 8 mH one ddNTP, 50 mM 
Naci) , for a final concentration of approximately 2M betaine. 
This was incubated for five minutes at 37 *C, then stopped with 
4 nl of stop solution (80* deionized formamide, ix TBE, 0.05% 
xylene cyanol, 0.05% bromophenol blue.) After 2.5 minutes at 
95«C, the tubes were briefly chilled on ice. 4.5 /il were 
loaded on a 6% polyacrylamide (19:1 acrylamide: 
bisacrylamide) /7M urea/lX TBE gel, which was run for various 
periods at 35W. The gels were fixed with a 12% methanol/12% 
acetic acid solution, transferred to Whatman paper, and dried. 
They were generally exposed overnight on Kodak XAR 5 film. 

The addition of betaine at a concentration of 2M 
eliminated virtually all stops when DKA sequencing was 
performed using a modified form of the method of Del Sal et 
al., supra, as described above. Sequencing without bataine 
leads to occasional stops r which tend to occur in CC-rich 
regions and in regions particularly far from the primer (Pig. 
1A). An analysis of seventeen such stops indicates that they 
tend to occur after the middle position of sequences similar 
to pyrimidine-guanine-cytosine, with eleven of seventeen stops 
examined matching this consensus and the other six differing 
at only one position. There was no obvious correlation with 
potential secondary structure of the single stranded template, 
as determined by the program MF0LD, part of the GCG package. 
In the presence of betaine (Fig. IB) , all of these stops 
disappear, allowing the correct sequence to be determined. 
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p *"f H gb?e nw * sequences farther f; 

obtained In the prpfl*ncg of tet^ne 
The incidence of stops in DMA sequencing by the 
dideoxy method increases in regions of the DHA located farther 
5 from the primer. This phenomenon limits the useful DNA 

sequence that can be obtained from one primer. However, the 
inclusion of betaine in the dideoxy sequencing reaction 
mixture decreases the incidence of stops in DKA regions far 
downstream from the primer. ThiB allows useful sequence 
10 information to be obtained in these regions of the DNA without 
doing an additional sequencing reaction with another primer. 

DNA sequencing was performed in the presence and 
absence of 2N betaine as described in example 1. The 
resulting sequencing reactions were run on a DKA sequencing 
15 gel as described in example 1, except that the gel was run for 
12 hours to examine longer DNA fragments. These longer DNA 
fragments clearly showed the sequence of DNA regions farther 
from the primer than those routinely obtained in dideoxy DNA 
sequencing procedures. Little useful sequence information 
20 could be determined from these gels in the absence of betaine. 
However, in the presence of betaine, DNA sequence could be 
read up to at least 520 nucleotides from the 3' end of the 
primer, with sequences beyond that becoming less readable 
because of the resolving power of the gel. 



Example 3 



unnurif ii 



prfpn-ratlonff j" »"f presence of betaine 
DNA was prepared generally following a standard 
alkaline lysis technique (See Silhavy et al. (1984) 
30 Experiments With Gene Fusions, Cold Spring Harbor Laboratory, 
cold Spring Harbor, New York 147-148) , followed by an ethanol 
precipitation. No organic extraction, RNAse treatment or 
other further purification technique was used. XLl-blue cells 
(Stratagene, San Diego, California) containing the pDSM3 
35 plasmid were used as a source of the DNA. The DNA was then 

sequenced as described for example 1. As expected, there were 
a large number of stops in the absence of betaine. However, 
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the stops present while sequencing this DNA disappeared when 
betaine was added to the termination reaction. 

Example A ft i m\ nation nf stops dwrinq DNA sequencing by 

phasing y jtT? betaine 

Betaine is capable of eliminating stops when added 
after the normal termination reaction. DNA sequencing was - 
carried out as described in example 1, except that no betaine 
was added with the termination mixture. After the 5 minute 
incubation at 37 either 3.5 j*l of 5.5 betaine solution or 
3.5 nl at water was added. The incubation was then continued 
for an additional 5 minutes and was stopped with stop solution 
as in example l. Addition of betaine at this point greatly 
reduced the stops observed in the absence of betaine. Thus, 
the bands representing, stops on the sequencing gels appear to 
represent halted DNA polymerase complexes. Betaine, added 
after the normal incubation with the termination mixes, is 
capable of allowing the halted DNA polymerase complexes to 
resume elongation. This demonstrates that betaine is not 
simply acting by stabilizing the DNA polymerase, since its 
addition at this point would have no effect if that were the 



Example 5 pi filiati on or reduction of the incidence Of 

ff * lQp e in modif ied dideoxv DNA sequencing 
procedures 

The OKA sequencing procedure of example 1 was 
altered to demonstrate that betaine is able to eliminate stops 
that occur in a variety of dideoxy DNA sequencing protocols. 
The effect of substituting a single-stranded DNA substrate or 
using different DNA polymerases in a dideoxy DNA sequencing 
procedure was examined. 

i)_ fiemiencihq a* sing le-stranded DNA 

Single stranded DNA was prepared from strain 
XLl-blue containing plasmid pDSH6, according to routine 
procedures. When single stranded DNA was sequenced by the 
procedure described in example 1, fewer stops were observed in 
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the absence of betaine than in double-stranded specimens. 
However, these stops were also eliminated by betaine. 




5 DNA sequencing was carried out essentially by the 

procedure described in example 1, except that the Klenow 
fragment of Z. coll DNA polymerase was used. 7.5 units of the 
Klenow enzyme was used and the extension mix was modified to 
include 20 MCi of 35 S-dATP and 0 . 1 Ml of DTT in addition to 

10 the GTP extension mix. Stops were observed in different 

places with the use of the Klenow enzyme but were generally 
eliminated by betaine, just as described above for the 
procedure using Sequenase. 

tina Tap polymerase enzyme 



15 £l 

DNA sequencing was carried out according to the 

promega »C»»1" DNA Sequencing System* protocol (Promega 
Corporation, Madison, Wisconsin, USA) . The protocol is 

described below. 
20 (1) End label primer. 

(2) Mix 4-40 fmol DNA template, 5 Ml 5x fmol 
sequencing buffer (250 mM Tris.Cl pH 9.0, 10 fflM MgC12) , 1.5 
pinole labeled primer, betaine and water to a final volume of 
16 Ml containing an appropriate concentration of betaine 

25 (e.g., 2M) . 

(3) Add 1 Ml sequencing Grade Taq DNA Polymerase (5 

u/mD 

(4) Add 4 Ml ot the above to each of the dNTP/ddNTP 
mixes according to the following table: 
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15 



20 







A 


C 


G 


T 


dATP 




20 mK 


20 JIM 


20 /XH 


20 MM 


dCTP 




20 MK 


20 MM 


20 MM 


20 MM 


7 DeazJ 


1 dGTP 


20 MM 


20 MM 


20 MM 


20 MM 


dTTP 




20 M» 


20 MM 


20 MM 


20 MM 


ddATP 




350 MM 








ddCTP 






200 MM 






ddGTP 








30 MM 




ddTTP 










600 MM 




(5) 


Overlay with on« 


drop of mineral oil 



95 a C 



(6) Place tubes in thermal cycler preheated to 



(7) 



Start appropriate cycling program (varies with 
one example is 95 °C for 2 minutes, 30 cycles of (95 °C 

for 30 sec, 42 # C/30 sec, 70*/60 sec.))* 

(8) Add 3 pi stop solution (10 mM NaOH, 95% 
formamide, 0.05% bromophenol blue, 0.05% xylene cyanol) • 

(9) Heat at 95° C for 2% minutes, ice quench, and 
load on a sequencing gel. 

Sequencing gels were run and the presence of DNA 
polymerase stops was determined as described in Example 1, 
herein. The incidence of stops observed in the reaction 
mixture was reduced in the presence of betaine. 



25 Example 6 Eliminatio n of stops during DNA sequencing bv 

DlnethYlolvcine . sar cosin e fmonomethvlalvcine) 
and trijaethvl amin e N-oxide fTMANOl 
Three additional compounds were demonstrated to 
decrease the incidence of stops in a dideoxy DNA sequencing 
30 procedure. Dinethylglycine, ear cosine (mononethylglycine) and 
trimethylamine N-oxide (TMAKO) (all from Sigma Chemical 
Company) were evaluated for their ability to eliminate stops 
in a dideoxy DNA sequencing method, as described in Example 1. 
DNA sequencing was otherwise performed as described in example 
35 1- TMANO proved to eliminate stops better than betaine, while 
dimethylglycine was somewhat less effective. Sarcosine had a 
measurable effect and is less preferred than the other 
compounds. Two other N-substituted charged compounds, 
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25 
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25 

tetraethylammonium chloride (TEACl) and tetraethylammonium 
acetate (TEAAc) were also tested. TEACl inhibits the DNA 
polymerase. TEAAc did not inhibit the DNA polymerase as much 
as TEACl, but also did not reduce the number of stops observed 
in the sequencing 9*1* 

•anaes of jne. TMANO. 

Ij^hvlalvcine effective in 

dldeoxv j n*»t*l od of DNA 

PY" thesis 

The concentration ranges of TMANO, sarcosine, 
diaethylglycine and betaine were determined according to the 
procedure described in Example 1. 

The results are summarized in the following table. 






m -oducts 

Betaine was shown to extend the maximum distance 
between primers for chain elongation in PCR. PCR was 
performed on T7 DNA as described in B. Krummel, Ph.D. Thesis, 
University of California, Berkeley, 1990. Approximately 50 
paole of each primer were combined with 2.5 Units of AmpliTag 
Polymerase (Perxin-Elmer, Norwalk, Connecticut. USA) and 0.5 
M g of phage T7 DNA in a 200 /tl reaction containing final 
concentrations of 200 (M dNTP, 50 mM KC1, 10 mM Tris-Cl pH8.3, 
2.5 mM MgCl 2 and 0.01% gelatin. Following a 1' hot start at 
94-C, 25 cycles of PCR were performed with cycle times of 90" 
at 94»C, 1» at 37«C, and 20' at 72-C. The products were 
precipitated, run on a 1% agarose gel and visualized with UV 

following ethidium staining. 

The addition of betaine at a concentration of 2M 
allowed two primers which were separated by approximately 4000 
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base pairs to amplify the intervening DNA region- This DNA 
region was not appreciably amplified in the absence of 
betaine. 

Example 9 PCR amplif ication of regions containing 

grflfr1fTO*** e PNA sequences 
Betaine was shown to allow PCR chain elongation 
through CC-rich DNA regions which have impeded chain 
elongation in the absence of betaine. PCR was performed on a 
plasmid containing multiple repeats of the sequence TGC, which 
we have shown to impede the progress of DNA polymerases (see 
Example 1, herein) - Although the primers were only 500 base 
pairs apart , a distance which can normally be easily amplified 
by pgr i the standard reaction conditions failed to produce any 
amplified product. When betaine was added to a concentration 
of 2M, the intervening region was amplified. 

It is understood that the examples and embodiments 
described herein are for illustrative purposes only and that 
various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included 
within the spirit and preview of this application and scope of 
the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by 
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WWAT TS C ^,ftTMED IS; 



1. A method of decreasing the incidence of DNA 
polymerase stops occurring in a reaction mixture containing a 
DHA polymerase comprising adding to the reaction mixture an 
amount of a composition of the formula: 

R l 



wherein: 

r 1 , R 2 , and R 3 may be the same or different and are 
independently selected from the group consisting of hydrogen, 
methyl, ethyl, and propyl, with the proviso that no more than 
two of R 1 , R 2 , and R 3 are hydrogen; and 

X is a moiety selected from the group consisting of: 
radicals of the formulas 
(a) =0 ; and 



I s 



wherein: 

R* is selected from the group consisting of methyl 
and hydrogen and, when combined with R 1 , forms a 
pyrrolidine ring; 

R s is selected from the group consisting of -C0 2 H 
and — S0 3 R; and 

n is an integer of from 0 to 2; 
with the proviso that, when R 1 and R 4 form a pyrrolidine 
ring, no more than one of R 2 and R 3 is hydrogen, and 
wherein the composition is added in an amount effective 
to decrease the incidence of DNA polymerase stops. 



2. The method of claim 1 wherein R , R* and R are 
the same or different and selected from the group consisting 
of methyl, ethyl and hydrogen with the proviso that no more 
than two of R 1 , R 2 and R 3 are hydrogen and, when R 1 and R form 
a pyrrolidine ring, no more than one of R a and R 3 is hydrogen. 



WO 95/20682 



PCT/US93/01200 



28 

3. The method of claim 2 wherein X is -CH 2 C0 2 H. 

4. The method of claim 3 wherein R 1 , R 2 and R 3 are 

methyl. 

5. The method of claim 3 wherein R 1 , R 2 are methyl 
and R 3 is hydrogen. 

6- The method of claim 3 wherein R 1 is methyl and 
R 2 and R 3 are hydrogen. 

7. The method of claim 2 wherein X is «0. 

8. The method of claim 7 wherein R 1 , R 2 and R 3 are 

methyl. 

9. The method of claim 2 wherein R 1 and R 4 form a 
pyrrolidine ring, R 2 arid R 3 are methyl, n ie 0, and R 5 is 
-C0 2 H. 

10. The method of claim 2 wherein R 1 , R 2 , and R 3 
are methyl and X is -CH 2 S0 3 H. 

11. The method of claim 1 wherein said reaction 
mixture is a reaction mixture for a chain termination method 
of DNA sequencing. 

12. The method of claim 11 wherein said chain 
termination method of DNA sequencing is a dideoxy DNA 
sequencing method. 

13. The method of claim 1 wherein said reaction 
mixture is a PCR reaction mixtxire. 



14. The method of claim 1 wherein said reaction 
mixture comprises unpurified DNA. 
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! 15. The method of claim 14 wherein said unpurified 

2 DNA is a crude cell lyeate. 



1 



16. A method of decreasing the incidence of DNA 

2 polymerase stops in a chain termination DNA sequencing method 

3 comprising the steps of: 

4 a) combining in an aqueous solution, a DNA 

5 molecule; a DNA polymerase capable of producing a nucleic acid 

6 complementary to a portion of said DNA molecule by using the 

7 DNA molecule as a template; a mixture of deoxyribonucleoside 

8 triphosphates; a chain elongation inhibitor; and an amount of 

9 a composition of the formula: 



10 
11 

13 ! 



16 



R 1 



R2-N-X CD 



14 

15 wherein: 



R 3 



R 1 , R 2 , and R 3 may be the same or different and are 

17 independently selected from the group consisting of hydrogen, 

18 methyl, ethyl, and propyl, with the proviso that no more than 

19 two of R 1 , R 2 , and R 3 are hydrogen; and 

20 X is a moiety selected from the group consisting of: 

21 radicals of the formulas 

22 (a) -0 ; and 

23 R* 

25 (b) -CH-(CH 2 ) n -R s 

26 wherein: 

27 r* is selected from the group consisting of methyl 

28 and hydrogen and, when combined with R 1 , forms a 

29 pyrrolidine ring; 

30 r 5 is selected from the group consisting of -C0 2 H 

31 and -S0 3 H; and 

32 n is an integer of from 0 to 2; 

33 with the proviso that, when R 1 and R 4 form a pyrrolidine 

34 ring, no more than one of R 2 and R 3 is hydrogen, and 

35 wherein the composition is added in an amount effective 
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1 
2 



to decrease the incidence of DNA polymerase stops, to 
form a reaction mixture; and 

b) incubating the reaction mixture to permit 
the DNA polymerase to form nucleic acid fragments of 
varying length by using the DMA molecule as a template, 
wherein said nucleic acid fragments are complementary to 
said DNA molecule. 

17. The method of claim 16 wherein said chain 
elongation inhibitors are 2 • , 3 .-dideoxyribonucleoside 
triphosphates . 

18. The method of claim 16 wherein R l , R and R 
are the same or different and selected from the group 
consisting of methyl, ethyl and hydrogen with the proviso that 
no more two of R 1 , R 2 and R 3 are hydrogen and, when R^ and R 
form a pyrrolidine ring, no more than one of R 2 and R is 
hydrogen . 

19. The method of claim 18 wherein X is -CH 2 C0 2 H. 

20. The method of claim 19 wherein R 1 , R 2 and R 3 
are methyl. 

21. The method of claim 19 wherein R 1 and R 2 are 
methyl and R 3 is hydrogen. 

22. The method of claim 19 wherein R 1 is methyl and 
2 R 2 and R 3 are hydrogen. 



l 23 

1 24 

2 are methyl. 



The method of 18 wherein X is =0. 

2 ' 

The method of claim 23 wherein R 1 , R and R- 



25. The method of claim 18 wherein R 1 and R 4 form a 
pyrrolidine ring, R 2 and R 3 are methyl, n is o, and R 5 is 



3 -C0 2 H. 
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! 26. The method of claim 18 wherein R 1 , R 2 , and R 3 

2 are aethyl and X is -CH 2 S0 3 H. 

1 27. The method of claim 16 wherein said DNA 

2 molecule is unpurif ied DHA. 

1 28. The method of claim 27 wherein said unpurif ied. 

2 dna is a crude cell lysate. 

! 29. The method of claim 16 wherein step a) further 

2 comprises the addition of a primer complementary to a second 

3 portion of said DNA molecule, said second portion of the DNA 

4 molecule located downstream to said first portion of the DNA 

5 molecule, wherein said DNA polymerase is capable of extending 

6 the 3« end of said primer to produce a nucleic acid 

7 complementary to said first portion of the DNA molecule. 

1 30. A kit for sequencing DNA by a chain termination 

2 method comprising a container which contains one or more 

3 deoxyribonuclecside triphosphates, a container which contains 

4 a chain elongation inhibitor, and an amount of a composition 

5 of the formula: 

R l 



6 
7 

R 3 



10 



19 



rM-x (U 



11 wherein: 

12 R 1 , R 2 , and R 3 may be the same or different and are 

13 independently selected from the group consisting of hydrogen, 

14 methyl, ethyl, and propyl, with the proviso that no more than 

15 two of R 1 , R 2 , and R 3 are hydrogen; and 

16 x is a moiety selected from the group consisting of: 

17 radicals of the formulas 

18 (a) -O ; and 



R 



4 



21 (b) -CH-(CH 2 ) n -R 5 

22 wherein: 



* 
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23 R 4 is selected from the group consisting of methyl 

24 and hydrogen and, vhen combined with R 1 , forms a 

25 pyrrolidine ring; 

26 R 5 is selected from the group consisting of -C0 2 H 

27 and -S0 3 H; and 

28 n is an integer of from 0 to 2; 

29 with the proviso that, when R 1 and R 4 form a pyrrolidine, 

30 ring, no more than one of R 2 and R 3 is hydrogen. 

1 31. The kit of claim 30 wherein R 1 , R 2 and R 3 are 

2 the same or different and selected from the group consisting 

3 of methyl, ethyl and hydrogen with the proviso that no more 

4 two of R 1 , R 2 and R 3 are hydrogen and, when R 1 and R 4 form a 

5 pyrrolidine ring, no more than one of R 2 and R 3 is hydrogen. 

1 32. The kit of claim 31 wherein X is -CHjCO^H. 

1 33. The kit of claim 32 wherein R 1 and R 2 are 

2 methyl and R 3 is hydrogen. 

1 34. The kit of claim 32 wherein R 1 is methyl and R a 

2 and R 3 are hydrogen. 

1 35. The kit of claim 32 wherein R 1 , R 2 and R 3 are 

2 methyl . 

1 36, The kit of claim 31 wherein X is -O. 

1 37. The kit of claim 36 wherein R 1 , R 2 and R 3 are 

2 methyl . 

1 38. The kit of claim 31 wherein R 1 and R 4 form a 

2 pyrrolidine ring, R 2 and R 3 are methyl, n is 0, and R 5 is 

3 -C0 2 H. 

1 39. The kit of claim 31 wherein R 1 , R 2 , and R 3 are 

2 methyl and X is -CHj-SOjH. 
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40. The kit of claim 30 further comprising an 
amount of a DHA polymerase enzyme. 
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